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IMAGE PROCESSING METHOD AND IMAGE PROCESSING APPARATUS 



Technical Field 

[0001] The present invention relates to an image processing method and 

apparatus, and more particularly, to a method and apparatus for eliminating a 
line-shaped image object from a moving image object overlapping with each 
other in a single image. 



Background Art 

[0002] Recently, as information communication technologies such as the 

Internet and the like make progress, contents of digital video information, 
digital image information, and the like rapidly increase in various fields. 
[0003] For example, to effectively and accurately search these contents, 

videos and images must be analyzed by a computer. However, there are 
many cases in which two or more objects are recorded in an overlapping state 
in one image captured. 

[0004] When the motion of one target object is traced and analyzed, the 

overlapping state acts as interference. Consequently, there is required a 
technology for separating two objects that overlap particularly in a binary 
image. 

[0005] This technology has been researched particularly in the field of 

OCR (optical character recognition). As disclosed in, for example, 
PCT/US96/04036, there are known a method and an apparatus for restoring 
obscure portions of an image composed of continuous characters or other 
patterns when overlapping lines exist therein. 
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[0006] The method proposes, as a method of restoring a lost portion of a 

circulating pattern, for example characters, in an image composed of pixels, 
includes a step of determining an effective pixel value in the vicinity of the 
lost portion of the circulating pattern and a step of restoring the effective pixel 
value which is a value corresponding to pixels including the lost portion of the 
circulating pattern by using at least a part of data stored in a double 
stochastic finite state machine pretrained with the circulating pattern. 
[0007] Although the method is effective in a pattern in which text and 

the like circulatingly appear, it is not suitable for contents such as videos that 
must be processed at high speed because processing executed by the method is 
complex, in addition to that it is difficult to use the method in an ordinary 
image. 

[0008] That is, in a video in which a player plays a sport on the ground 

on which, for example, lines are drawn and further a net is disposed, it is 
contemplated that the player overlaps with the lines and the net in almost all 
the times. In this case, the conventional method cannot be used in a scene in 
which the motion of the player is to be analyzed. Further, since the player 
moves at high speed, high speed processing must be realized. There is 
conventionally proposed no technology that can effectively cope with the above 
case. 



[0009] 



Disclosure of the Invention 

The present invention is created based on the conventional 
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background described above, is a technology for permitting two overlapping 
objects included in an image to be separated and extracted at high speed, and 
is particularly suitable to a case in which one object is a line-shaped image 
object. The present invention will be sequentially disclosed below. 
[0010] That is, in an image processing method of eliminating a line- 

shaped image object, which overlaps with a moving image object in one image 
comprising effective or ineffective pixels, from the moving image object, the 
method comprises the respective steps of a line segment extraction step for 
extracting a line segment from the line-shaped image object by a line segment 
extraction means, an elimination step for eliminating the line shaped image 
object from the moving image object by a line-shaped image elimination 
means, a pixel extraction step for scanning a vicinity region of the line 
segment on the moving image object and sequentially extracting pixels to be 
scanned by an image scan means, an effective pixel determination step for 
determining whether or not the extracted pixels to be scanned are effective 
pixels by an effective pixel determination means, and a pixel interpolation 
step for dropping a perpendicular from the pixels to be scanned that are 
determined to be the effective pixels at the effective pixel determination step 
to a nearest line segment and setting all the pixels on the perpendicular as the 
effective pixels by a pixel interpolation means. 

[0011] Further, the image may be one frame in a moving image object 

comprising a plurality of frames. 

[0012] Otherwise, the image may be an image obtained by subjecting a 

single frame or plural frames in the moving image comprising the plurality of 
frames to predetermined arithmetic processing. In this arrangement, the 
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arithmetic processing may be any one of processing for determining a 
difference between two arbitrary frames in the moving image or processing for 
determining a change region in one arbitrary frame in the moving image 
object. 

[0013] Further, the processing for determining the change region in the 

one arbitrary frame in the moving image object may be processing for 
extracting predetermined frames before and after the one frame and obtaining 
difference images between each predetermined frame and the one frame, 
respectively as well as executing an ANDing operation of both of the difference 
images. 

[0014] In the above arrangement, a line segment may be extracted using 

the Hough transform at the line segment extraction step. 

[0015] Further, the present invention can also provide the following 

image processing apparatus for eliminating a line-shaped image object, which 
overlaps with a moving image object in a single image comprising effective or 
ineffective pixels, from the moving image object. 

[0016] That is, the apparatus is characterized by comprising a line 

segment extraction means for extracting a line segment from the line-shaped 
image object, a line-shaped image elimination means for eliminating the line- 
shaped image object from the moving image object, an image scan means for 
scanning a vicinity region of the line segment on the moving image object and 
sequentially extracting pixels to be scanned, an effective pixel determination 
means for determining whether or not the extracted pixels to be scanned are 
effective pixels, and a pixel interpolation means for dropping a perpendicular 
from the pixels to be scanned that are determined to be the effective pixels at 



Application No. 1 0/550, 896 Attorney Docket No. 4035-01 7 5 PUS 1 

Substitute Specification Art Unit: 2624 

Reply to Office Action dated May J, 2008 Page 5 of 17 

the effective pixel determination step to a nearest line segment and setting all 
the pixels on the perpendicular as the effective pixels. 

[0017] The image processing apparatus may comprise a frame 

extraction means for extracting a single frame or a plurality of frames from a 
moving image comprising a plurality of frames and a frame arithmetic 
processing means for subjecting an extracted frame to predetermined 
arithmetic processing and use a result of the arithmetic processing as the 
image. 

[0018] Further, the image processing apparatus may comprise a frame 

extraction means for extracting a single frame or a plurality of frames from a 
moving image comprising a plurality of frames and a frame arithmetic 
processing means for subjecting an extracted frame to predetermined 
arithmetic processing and uses a result of the arithmetic processing as the 
image. The frame arithmetic processing means may be arranged such that it 
executes any processing of processing for determining a difference between 
two arbitrary frames in the moving image object and processing for 
determining a change region in one arbitrary frame in the moving image 
object. 

[0019] Further, the processing for determining the change region in the 

one arbitrary frame in the moving image may use processing for extracting 
predetermined frames before and after the one frame and obtaining difference 
images between each predetermined frame and the one frame, respectively as 
well as executing an ANDing operation of both the difference images. 
[0020] In the image processing apparatus described above, the line 

segment extraction means may extract a line segment using the Hough 
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Brief Description of the Drawings 
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[0026J 


ing. 6 is a view explaining a court model of a tennis court. 


[0027] 


Fig. 7 is a view explaining a net model of a tennis court. 


[0028] 


Fig. 8 shows a binary image B c (t). 


[0029] 


Fig. 9 shows a binary image Bi(t). 


[0030] 


Fig. 10 shows a binary image E$2(t). 


[0031] 


Fig. 11 shows a binary image Bdiff(t). 


[0032] 


Fig. 12 shows a binary image Biabei(t). 


[0033] 


Fig. 13 shows a binary image L f c (t). 


[0034] 


Fig. 14 shows a binary image L f n (t). 


[0035] 


Fig. 15 is an enlarged view in the vicinity of a player after a line 



segment is deleted from B'dit^t). 

[0036] Fig. 16 is an enlarged view in the vicinity of the player after a 

line segment is deleted from BWiCt). 
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[0037] Fig. 17 is an enlarged view in the vicinity of the player of B f diff(t) 

after pixels are interpolated. 

[0038] Fig. 18 is an enlarged view in the vicinity of the player of B'iabei(t) 

after the pixels are interpolated. 

[0039] Fig. 19 is a view explaining how a player region is extracted. 

[0040] Reference numerals denote the following portions. 

[0041] 10- line segment extraction step, 11: line segment image 

elimination step, 12: step for scanning line segment vicinity region, 13: step 
for determining whether or not pixels to be scanned are effective pixels, 14: 
processing for dropping a perpendicular between pixels to be scanned and line 
segment, 15: step for converting all the points on the perpendicular into 
effective pixels, 30: line-shaped image object, 31: moving object, 32: pixels to 
be scanned 



Detailed Description of the Preferred Embodiments of the Invention 
[0042] A preferable method of embodying the present invention will be 

described referring to the drawings. Note that the embodiment of the 
present invention is not limited to the one described below and may be 
appropriately modified. 

[0043] Fig. 1 shows a flowchart of an image processing method of the 

present invention, and Fig. 2 shows an image processing apparatus 
(hereinafter called the apparatus) according to the present invention, 
respectively. 

[0044] The apparatus (l) can be composed of a known personal 
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computer that is composed of a CPU (2) for executing arithmetic processing, a 
RAM/ROM memory (3), an external storage unit (4) such as a hard disc and 
the like, an input unit (5) such as a keyboard, a mouse, and the like, a monitor 
(6) for display an output. 

[0045] In the apparatus (l), an image capture means (7) for inputting a 

video is connected to the CPU (2), in addition to the above units, so that a 
video signal from a not shown came and video replay device is recorded to the 
external storage unit (4). 

[0046] When the video signal is an analog signal, an analog/digital 

converting function is mounted on the image capture means (7), thereby 
digital data is recorded to the external storage unit (4). 

[0047] In a core portion of the present invention, an image captured by 

the image capture means (7) and recorded to the external storage unit (7) is 
subjected to processing as shown in Fig. 1. 

[0048] First, a line segment extraction unit (22) of the CPU (2) extracts 

a line segment from a line-shaped object (30) in the image at a step (10). 
Although any optional methods may be employed to separate the region of the 
line-shaped image object from the image, when, for example, a line-shaped 
image object has a predetermined shape, the line segment may be extracted 
according to the shape. 

[0049] Then, a line-shaped image elimination unit (23) enlarges the line 

segment obtained at the above step and removes the line segment from a 
portion in which at least a moving object (31) is included in the image (line 
segment image elimination step (ll)). It is needless to say that a line-shaped 
image object may be eliminated from the entire image. The eliminated image 
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can be stored in the memory (3). 

[0050] When the line segment image object is eliminated as described 

above, since the portion of the line segment image object (30) overlapping with 
the moving object (31) is eliminated, a band-shaped ineffective pixel portion is 
made on the intrinsic moving object. 

[0051] The present invention has the following steps to interpolate it 

effectively. 

[0052] For convenience's sake, Fig. 3 shows an example of the line- 

shaped image object (30), Fig. 4 shows an example of the moving object (31), 
and Fig. 5 shows a state in which the moving object (31) is segmentized at the 
elimination step (ll). In the respective figures, pixels with "1" show effective 
pixels, and pixels with "0" show ineffective pixels. 

[0053] In Fig. 3, band-shaped effective pixels (40) arranged in a 

horizontal direction are the line-shaped image object (30) as the line-shaped 

image object, and the other regions are ineffective pixels. Then, a line 

segment (41) is extracted at line segment extraction step (10). 

[0054] In contrast, as shown in Fig. 4, the moving object (31) is 

composed of a set of multiple pixels (50) ... expressed by "1". 

[0055] In an actual image, the line-shaped image object (30) overlaps 

with the moving object (31), and elimination of the line-shaped image object 

(30) results in a state shown in Fig. 5. That is, an ineffective pixel band (60) 

is made in the moving object (31), thereby the moving object (31) is 

segmentized. 

[0056] To interpolate the segmentation, in the present invention, the 

pixels in the vicinity of the line segment (41) are sequentially scanned by an 
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image scan unit (24) of the CPU (2), and the pixels within a predetermined 
threshold value are extracted. 

[0057] At the time, when the line segment (41) is enlarged to a line 

width of three pixels at the line segment image elimination step (ll) (in the 
cases shown in Figs. 3 to 5), the pixels spaced apart from the line segment by 
two pixels may be extracted. The to be scanned pixels (32) extracted as 
described above can be also temporarily stored in the memory (3). 
[0058] Further, as shown in Fig. 6, the pixels (42) on the line segment 

(41) may be sequentially scanned, screen coordinates (44), (44), (45), (45) (in 
this case, four types of screen coordinates exist to a single pixel (42)) may be 
determined by adding and subtracting a value, which is obtained by adding 1 
to a size (43) one half the line width, to and from the screen coordinate of the 
pixel (42), which is obtained each time the pixels (42) are scanned, in the 
respective directions of x- and y-directions, and the pixels of the screen 
coordinates may be used as the pixels to be scanned (32). 

[0059] Then, an effective pixel determination unit (25) determines 

whether or not the extracted pixels are effective pixels at determination step 
(13). 

[0060] As a result, when the pixels are effective pixels (having 

information of "1"), a pixel interpolation unit (26) drops a virtual 
perpendicular between the pixels and the line segment (41) at step (14) and 
converts all the ineffective pixels on the perpendicular into effective pixels at 
step (15) sequentially. Processing for making the ineffective pixels effective 
can be executed by rewriting the information of "0" allocated to the respective 
pixels to "1". 
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[0061] To specifically show the above-mentioned, when scanning is 

sequentially executed in Fig. 5 from a pixel (61), which is spaced apart from 
the line segment (41) by the distance of two pixels, to pixels (62), (63), ... , the 
pixel (61) cannot be scanned because it is an ineffective pixel. Since the 
image pixel (62) is an effective pixel, a perpendicular (64) is calculated, and 
pixels (65), (66) positioned on the perpendicular (64) are made to effective 
pixels. As apparent from the above description, the pixels lost by eliminating 
the line-shaped image object (30) can be interpolated by repeating the above 
process, thereby an image very near to the original moving object (31) can be 
obtained. 

[0062] A main portion of the present invention is as described above. 

Further, the following arrangement can be provided as an example to which 
the present invention is applied. 

[0063] That is, in the above description, the flow of processing is 

explained using the single image. However, the present invention can be 
most appropriately applied in particular to a video composed of a plurality of 
frames in time series, and the CPU (2) can suitably extract arbitrary one 
frame in the above frames through a frame extraction unit (20). The thus 
extracted one frame may be also processed as the above image. Further, as 
described later, a plurality of frames may be extracted. 

[0064] Further, the CPU (2) may be provided with a frame arithmetic 

processing unit (21) to subject a captured image to arithmetic processing 
according to a predetermined system. Since the arithmetic processing can be 
realized by a known image processing technology, an arithmetic operation 
method of the processing may be arbitrarily determined in the present 
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invention. 



[0065] 



Next, an embodiment of the present invention will be described 



using a player playing a game on a tennis court as an example. When a video 
of tennis is recorded, the player is recorded while overlapping with line- 
shaped image objects of court lines and net lines drawn on the ground, which 
interferes extraction of the movement of the player. Accordingly, in the 
present invention, the movement of the player is separated and extracted at 
high speed with pinpoint accuracy by the technology. 

[0066] First, the court and net lines are extracted as a line-shaped 

image object (30). At the time, a form of the tennis court is used. As shown 
in Fig. 6, court characteristic points P c i, ... , Pci4 and court lines L c i, ... , L C 9 
are defined. Likewise, net characteristic points P n i, ... , Pn3 and net lines L n i, 
Ln2 as shown in Fig. 7 are determined, and they are referred to as a court 
model and a net model. 

[0067] The original points of the respective models are set to the center 

of the court, and coordinate axes are determined as shown in the figure. The 
court model is defined on an XY plane, and the net model is defined on a ZX 
plane. 

[0068] First, the court lines are extracted. The court lines L c i, ... , L C 9 

in Fig. 6 are determined by detecting the court characteristic points at both 
the ends thereof. The process is composed of the following steps. 
[0069] At time t = 0, an initial characteristic point P c (0) is given as an 

input. Next, as to a court line L c (0) determined by P c (0), respective lines are 
converted into the Hough plane. There are prepared detection windows W c (0) 
which are centered on the respective peak points on the Hough plane and 
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have magnitudes Wth, W ro . 

[0070] At time t = t, first, a binary image B(t) of an original image is 

ANDed with the vicinity region of a court line L c (t ■ l) to thereby create a 
binary image (called a court line binary image) B c (t) composed only of a court 
vicinity. Fig. 8 shows the thus created image. The court line binary image 
shown in the figure and a net line binary image to be described later 
correspond to line-shaped image objects in an image that is referred to in the 
present invention. 

[0071] They are subjected to the Hough transform every line, peaks are 

detected within the range restricted by respective detection windows W c (t - l), 
and characteristic points P c (t) are updated. 

[0072] A court line L c (t) is subjected to the Hough transform again, and 

a detection window W c (t) is also updated. The process returns to step (ii). 
[0073] When the net line is extracted, an initial characteristic point 

P n (0) at time t = 0 is given as an input, and a net line L n (0) and a detection 
window W n (0) are prepared. Further, at time t = t, B(t) - B c (t), which is 
obtained by subtracting the court line binary image from the binary image of 
the original image, is ANDed with the vicinity region of a net line L n (t ■ 1) to 
thereby create a net line binary image B n (t). B n (t) is subjected to the Hough 
transform, peaks are detected within a detection window range, and a 
characteristic point P n (t) is updated. 

[0074] There is known a method called the Hough transform described 

above (for example, US Patent No. 3069654). In addition to the Hough 
transform, there is hardware for the Hough transform that is shown in 
"Hardware for the Hough Transform Using ROM" by Kunio Onda et al ('87 
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General Assembly Commemorative of 70th Anniversary of Institute of 
Electronics, Information and Communication Engineers, No. 1587). Further, 
there are also methods disclosed in Japanese Patent No. 2935863 and 
Japanese Patent Application Laid-Open Publication No. Hei"6-274619. Note 
that the Hough transform is variously improved as disclosed in Japanese 
Patent No. 2646363 and the like, and the modified methods may be arbitrarily 
used. 

[0075] In the present invention, the line segment extraction unit (2) of 

the CPU (2) executes the arithmetic processing of the Hough transform. 
[0076] Further, various known methods such as a straight line 

approximation method using a least squares method can be used to extract a 
line segment, in addition to the above method. Further, the present 
invention can be also applied to any line-shaped (band-shaped) objects such as 
circular, arc, polygonal, and other objects in addition to a straight line object 
as long as an image is a line-shaped image. 

[0077] Next, the following steps are executed to extract the player 

(moving object). 

[0078] The frame extraction unit (20) extracts a reference frame at time 

t = t and frames apart from the reference frame by a time s before and after 
the reference frame. Then, the differences between the front and rear frames 
and the reference frame, and binary images Bi(t) and B2(t) are created using 
an appropriate threshold value. The frame arithmetic processing unit (21) of 
the CPU (2) executes an arithmetic operation for determining the differences. 
[0079] As a method of determining the threshold value, any arbitrary 

methods may be used which include an Ohtsu f s method (method of 
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determining a threshold value by minimizing the dispersion in an average 
class), Kittler's method (method of determining a threshold value by 
minimizing entropy with average conditions), a region dividing method 
(method of dividing an image into several small regions and determining a 
threshold value most suitable for the property of each small region), and the 
like, in addition to a method of using a predetermined fixed value as a 
threshold value. 

[0080] Figs. 9 and 10 show the binary images Bi(t) and I$2(t) obtained 

here, respectively. 

[0081] (II) Fig. 11 shows a binary image obtained by ANDing the two 

difference images with each other as a binary image BdifKt). Change regions 
in one frame of an initial video (moving image) can be determined by the step. 
They can be subjected to arithmetic processing in the frame arithmetic 
processing unit (21) of the CPU (2). 

[0082] As the processing executed by the frame arithmetic processing 

unit (21), two arbitrary frames may be extracted by the frame extraction unit 
(20), and the difference between them may be determined and subjected to 
known processing by the frame arithmetic processing unit (21) as often used 
in image processing, in addition to the above processing. 

[0083] (III) Color clusters corresponding to typical colors of player's 

uniforms and the like are previously prepared, the pixels, which are included 
in the color clusters, of the pixels of an image I(t) at time t = t are set as 
effective pixels "1", thereby a binary image Bi a bei(t) is created as shown in Fig. 
12. A method of extracting the pixels included in the color clusters is known, 
and the resultant binary image Bi a bei(t) is an image obtained by subjecting a 
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single frame in a video to arithmetic processing executed by the frame 
arithmetic processing unit (21). 

[0084] (IV) Binary images obtained by enlarging court net lines L c (t), 

L n (t) are shown by L/ C (t), L f n (t) (shown in Figs. 13 and 14, respectively), and 
B^ifKt), B f iabei(t) are obtained by deleting these regions from BdifKt), Biabei(t). 
[0085] Figs. 15 and 16 show enlarged views in the vicinity of the player 

after they are deleted. As described above, in the present invention, as to a 
region from which a portion overlapping with a player region is deleted by the 
delete processing, the vicinity regions of L ? c (t), L f n (t) are scanned on B'diff(t), 
B f iabei(t). When the pixels l p to be scanned on B ? diff(t), B'iabei(t) are effective 
pixels, the pixel group l pv existing up to the foot of a perpendicular dropped 
from l p to L c (t) or L n (t) are made to effective pixels. 

[0086] Figs. 17 and 18 show enlarged views of the vicinity of the player 

of B f diff(t), B f iabei(t) obtained by the processing, respectively. 

[0087] (V) Finally, the two images are ORed with each other and made 

into a binary image B ca nd(t). A reason why the OR operation is executed in 
place of the AND operation is to extract the regions of the hands and foots of 
the player as stably as possible, even if they move. When the player's region 
is extracted only by the simple color labeling, it may be difficult to stably 
extract the player's region at the time dissolution is insufficient or the color of 
the player uniform is not almost different from a background color. Further, 
when only time subtraction images are used, a search is often missed when 
the player moves slowly. The OR operation is executed here to stably extract 
the player region in more detail while interpolating the component of a motion 
by the time subtraction images. 
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[0088] As a result of the above steps, the player is extracted as shown in 

Fig. 19 by setting a region having a predetermined area or more in the binary 
image B ca nd(t) as a player initial position. 

[0089] It should be noted that noise processing is executed in Fig. 19, 

and, in the embodiment of the present invention, the image processing can be 
executed by appropriately using the CPU (2). 

[0090] As described above in detail, according to the present invention, 

there can be provided the image processing technology for eliminating a line- 
shaped image object, which overlaps with a moving image object in one image 
comprising effective or ineffective pixels, from the moving image object. 
Since the arithmetic processing executed in the above method is simple, the 
method is very effective when there are many frames as in a video and each 
one frame must be processed at high speed. 

[0091] The invention being thus described, it will be obvious that the 

same may be varied in many ways. Such variations are not to be regarded as 
a departure from the spirit and scope of the invention, and all such 
modifications as would be obvious to one skilled in the art are intended to be 
included within the scope of the following claims. 



